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Abstract 

There are many articles discussing the solution of boundary value prob- 
lems on various parallel machines. The solution of initial value problems 
does not lend itself to parallelism, since in this case one uses methods that 
are sequential in nature. 

Here we develop a parallel scheme for initial value problems based on 
the box scheme and a modified recursive doubling technique. 

Fully implicit Runge Kutta methods were discussed by Jackson and 
Norsett (1986) and Lie (1987). Lie assumes that each processor of the 
parallel computer having vector capabilities. 



1 Introduction 

We consider the solution of linear initial value problems on a hypercube. “By a 
hypercube we intend a distributed memory MIMD computer with communica- 
tion between processors ... via a communication network having the topology of 
a p-dimensional cube, with the vertices considered as processors and the edges 
as communication links” (Keller and Nelson, 1987). See also Fox (1984, 1985, 
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1987) and Fox and Otto (1984). Our method of solution is based on the box 
scheme to discretize the system of initial value problems 

y' = Ay + f(x) 

y{a) - y'o 

where y and / are n-dimensional vectors and A is an nx n matrix. The resulting 
system of equations is solved by a modified version of the recursive doubling 
technique (see Stone, 1973). 

In the next section, the discretization is described and the resulting system 
of equation is given. Section 3 will describe the modified recursive doubling 
technique and its application to our system. 

It will be interesting to experiment with the method and compare the results 
to a sequential initial value solver of the same order. 



2 The Single Step Method 



Consider the system of initial value problems 

y' =A(x)y + f(x), a<x<b 

y( a ) - y'o 



where 



and 



Let 



where 



y = (yi,---,yn) T , / = {fi{x),...,f n {x)) T , 

yo (yio» • • • > yno) > 

A — aij(x), 1 < j < n. 

Xj — a+jh, j = 0,1, . . . ,m 



h = 



b — a 



m 



(i) 



( 2 ) 

( 3 ) 



2 



be a uniform mesh. The box scheme (see e g. Keller, 1976), applied to (1) yields 

( 4 ) 

yo = yo 

where 



Vj+i 


— yj + h 


{v 


yo 


= y'o 








= A 






— f 1 



and yj is the approximation to y(xj). 

Let {ji, t = 1,2,..., 5 } be a strictly increasing sequence such that j\ > 0 
and j 3 = m. We shall compute the solution at the points x t = Xj. . Let $ t be 
n X n matrices defined for each t 

*<= n 1 ( /+ £'Vi)’ <=1 ' 2 *■ < 5 > 



3=Ji- l 



where jo — 0 and h is sufficiently small so that I — ^ 1 are nonsingular. 

Similarly let the n-vector be 



4 -§> 



( 6 ) 



t — 1 ) 2 , . . . , s , 



where 



and 



yo = 0 



( 7 ) 



j = 0,.. .,ji - ji-i - 2. 

Then it can be easily shown as in Keller and Nelson (1987), that 



(8) 



— ^ >1 yji - 1 + *pi > 



i= 1,2,..., s. 



( 9 ) 
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Remarks 



1. The matrices to be inverted are of order n, the number of equations in the 
original system (1). 

2. The last factor in the product defining $ t is the matrix required in com- 
puting <pi. 

3. The vector can be computed by (7) - (8) in the same loop one 

computes <2> t since it requires the same matrices. 

3 Parallel Evaluation 

To solve (9) on a hypercube with p = s processors, one can modify the recursive 
doubling technique developed by Stone (1973). 

Let 

b l =$iVo + ^i 

(10) 

bj — <pj , j = 2 , 3 , . . . , s 

and let Y%[j) be a function of 6y,6y_i, . . . ,6y_ t+1 ,$y, . . . Then the fol- 

lowing results can be proved using similar arguments as in Stone(1973). 
Theorem . Let Y%{j) satisfy the recurrence relation 

Yi+iU) = YiU) + - 1)> U > 1 (11) 

with boundary conditions 

U(i) - bj, j > 1 

(12) 

Y t (j) = 0, j < 0 or i < 0. 

Then 
(i) 

Yi+» U) = Ys (i) + n >+i y « U - s) 

k=j-3 + 1 
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(13) 



(>') 



n i>j> i, (i4) 

k— 1 I 3 — k-} 1 



(iii) for 



Corollary 



* >3 > 1, 



*(i) = y,r 



(15) 



YM = Y t (j ) + n v.(i - 0. «,i > * ( 16 ) 

U=>-*+» J 

This corollary provides the recursive doubling algorithm for the solution of 
(9). Let 



k 1 



j < i 



M,(j) = 



II ®2j-k+l-i, j > * 

k-j-i+1 



then (16) can be written as 

Y*(j) = Yi(j) + M i {j)Y i (j-i) i,j> 1 
M 2 ,{j) = - *) i,j > 1 

with boundary conditions 

M\ = 3> 1 

^i(i) = /> * < 0 or j < 0. 

We are now ready to state the algorithm. 

Algorit hm 

For i = 1 to s/2 in steps of t do: 

YnU) = Y, (j) + M,(j)Y x (j - i) i<j<s 
M 2 ,(j) = - i) i < j < s 



(17) 



(18) 



(19) 
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Next i. 



From our theorem, Y s (j) = yj 9 for 1 < j < s, so that F 5 is the solution of (9). 
We note that for each z, the indices pertaining to j are executed simultaneously 
on 5 processors. Since z doubles during each iteration, log 2 s iterations are 
required for computation. 
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